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ABSTRACT 

We report on the limitations of sky subtraction accuracy for long duration fibre-optic 
multi-object spectroscopy of faint astronomical sources during long duration expo- 
sures. We show that while standard sky subtraction techniques yield accuracies con- 
sistent with the Poisson noise limit for exposures of 1 hour duration, there are large 
scale systematic defects that inhibit the sensitivity gains expected on the summa- 
tion of longer duration exposures. For the AAOmega system at the Anglo- Australian 
Telescope we identify a limiting systematic sky subtraction accuracy which is reached 
after integration times of 4-10 hours. We show that these systematic defects can be 
avoided through the use of the fibre nod-and-shuffle observing mode, but with poten- 
tial cost in observing efficiency. Finally we demonstrate that these disadvantages can 
be overcome through the application of a Principle Components Analysis sky sub- 
traction routine. Such an approach minimise systematic residuals across long duration 
exposures allowing deep integrations. 

We apply the PCA approach to over 200 hours of on-sky observations and conclude 
that for the AAOmega system the residual error in long duration observations falls at 
a rate proportional to t~ - 32 in contrast to the r~ - 5 rate expected from theoretical 
considerations. With this modest rate of decline, the PCA approach represents a more 
efficient mode of observation than the nod- and- shuffle technique for observations in 
the sky limited regime with durations of 10-100 hours (even before accounting for the 
additional signal-to-noise and targeting efficiency losses often associated with the N+S 
technique) . 

This conclusion has important implications for the observing strategies of the 
next generation of fibre-optics redshift surveys with existing facilities as well as design 
implications for fibre-optic systems destined for new facilities. It argues against the 
use of the inherently inefficient nod-and-shuffle technique for faint object fibre-optic 
survey spectroscopy. 

Key words: methods: data analysis, methods: observational, techniques: image pro- 
cessing, instrumentation: spectrographs 



1 INTRODUCTION 



High multiplex multi-object spectrographs have been key- 
to many core astronomical investigations since their rou- 
tine operation began over three decades ago. How- 
ever, even with the great advances made in our un- 
derstanding of the Universe via large scale spectro- 
scopic surveys (such as those previously under t aken at 
the A AT wit h the 2dF instrument: 2dFGRS - ICollessI 
(Il999h. 2QZ -iBovle. Shanks. Croom etal\ d2000T) Wig gleZ 
iDrinkwater etali (|2009h . GAM A - iDriver et aU ij2QQ% ) , or 
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closer to home with stellar surveys (such as the UK ST RAVE 
project - ISteinmetz. Zwitter. Siebert etali ((200y)), there re- 
mains an ever increasing desire for more powerful survey 
machines. As astronomy enters the era of the Extremely 
Large Telescopes (ELT) it is appropriate to pause and as- 
sess the techniques and methodologies available to modern 
instrumentation in order to fully utilise the valuable photons 
gathered by these gargantuan machines. 

In the same vein, older facilities strive to identify new 
modes of operation. These stalwarts of the past two decades 
of research must find the best modes of operation to sup- 
port the ambitions of the new generation of facilities, whose 
availability will be restricted at best. It is critical that we 
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take time to review the options and identify the roles each 
instrument must play to maximise the overall return on our 
significant investments, both material and intellectual. 

There are many subtleties to the process of spectro- 
graph design and that discipline is not considered here (for 
an introduction see iHearnshawl (|2009r )) but the problem 
of MOS system design is in essence that of identifying the 
most expedient reformatting of the telescope focal plan 
in order to deliver dispersed light to the detector system. 
Historically three methods have been employed, objective 
prism surveys, focal plane slit-mask slit-spectroscopy and 
fibre-optic systems. In any high multiplex spectroscopic 
system the correct mode of observations is governed by a 
number of competing survey demands: 

Target Multiplex - Maximising the total number of science 
targets which can be simultaneously observed. 

Signal-to-noise - Minimising the required integration time 
to reach a predefined signal-to-noise level within the 
elements of each spectrum. 

Spectral quality - Minimising the systematic errors within 
each spectrum. 

Systematic defects can of course be considered as an ad- 
ditional noise source. However while noise associated with 
a Poisson process, such as the rate of photon arrival at the 
detector, can be suppressed as the square root of the obser- 
vation time, systematic defects are typically removed at a 
slower rate. 

In this paper we focus on the use of fibre-optic systems 
and specifically on an optimal strategy for the accurate sub- 
traction of the OH air-glow lines which plague optical spec- 
troscopy in the wavelength region above ~6000A. Inaccurate 
subtraction of the OH lines is the dominant source of sys- 
tematic error in sky limited fibre spectroscopy at the red 
end of the optical spectrum. There are many discussions of 
the OH problem in th e literature and we refer t he re ader 
to the recent review of lEllis fc Bland-Hawthorn! (|2008r ). In 
this work we consider the merits of two novel techniques 
alongside the traditional dedicated sky fibre method. We 
investigate the use of the nod- and- shuffle mode of observa- 
tion (jGlazebrook fc Bland-Hawthornl200l[ ) coupled with the 
use of a high multiplex fibre-optic multi-object spectroscopy 
system. We also investigate the application of a Princi- 
ple Components Analysis (PCA ) to improve s k y sub trac- 
tion, following the tec hniques of iKurtz fc M ink ( 200 0J) and 
IWild fc Hewetj l|2005r ) as implemented by IParkinson et al\ 
I in prep.1) and outlined in £|2.3I We use data sets from the 
AAOmega facility at the 3.9m Anglo-Australian Telescope 
(AAT) to demonstrate the results. 



which project to ~2 arcsec at the focal plane of the 2dF 
prime focus corrector. 

Observations are shown from the red arm of the dual 
beam AAOmega system, using the 385R Volume Phase 
Holographic (VPH) grating. Three data sets from two pro- 
grams are used for the purposes of illustration. Details 
are given in Table [l] In the first inst ance moderate sig - 
nal spectra from the GAMA project IjDriver etali ;2009) 
are used. Secondly, in order to track long term systematic 
error trends ~200 hours of low signal, essentially sky lim- 
ited, observations from the WiggleZ dark energy project 
IjDrinkwater etali 120091 ') are used. Finally to demonstrate 
the nod-and-shuffle technique, a series of deep N+S frames, 
taken as part of the completeness estimation process for the 
WiggleZ project, are used. Data were taken using both the 
5700A and 6700A dichroic beam splitters. Example spectra 
are shown in units of CCD counts converted to electrons via 
the measured gain parameter. 



2.1 Data processing 

The data used in the following analysis has been processed 
using the 2dfdr data analysis package. All data was 
processed using the standard settings for low resolution 
extra-galactic data. Sky subtraction has been performed 
using three techniques. 

Dedicated sky fibres - Twenty-five fibres where allocated to 
known blank sky positions. The spectra from these fibres 
are median combined to create a master sky spectrum. This 
master sky spectrum is then scaled for subtraction from 
each science spectrum using the OH lines. 

Nod-and-shuffl e (N+S) - Observations where per formed 
in this mode IjGlazebrook fc Bland-Hawthornl 12001! ) using 
a pair of fibres allocated to each science target (Cross 
Beam Switching, CBS). These observation are processed 
in a manner similar to standard AAOmega data with 
the exception that sky subtraction is achieved by di- 
rect subtraction of the raw CCD pixel intensities for the 
A and B telescope nod positions prior to spectral extraction. 

Principle components analysis (PCA) - The final method 
used has been to perform the standard dedicated sky 
fibres reduction but once complete the data was processed 
using the a Principle Components Analysis sky subtrac- 
tion routine as impleme nted for the GAMA survey by 
IParkinson etali (|in prepl) . 

The sky spectrum recorded during each integration is 
used as the normalisation when considering the residual sky 
subtraction error in any given data set. 



2 OBSERVATIONAL DATA SETS 

The example spectra throughout this wor k where taken 
with the the AAOme ga spectrograph (jSaunders et al\ 
|2004r ). ISharp etal\ \200$ \) at the Anglo-Australia Telescope 
(AAT). The fibre feed from the 2dF robotic positioner to the 
AAOmega spectrograph uses 140±2 /im core optical fibre [j 

1 Polymicro FBP140168196 - Broadband Polyimide coated 
140±2 /im core, 168±2/^m, cladding 196±3/xm buffer. 



2.2 Overview of nod-and-shuffle 

The nod-and-shuffle technique has been shown to pro- 
vide Poisson limited performance i n slit spectroscopy 
IjGlazebrook fc Bland-Hawthornl 120011 ) with perhaps the 
best known example of the results achievable being the deep 
spectroscopy of high redshif t early-type galaxies from the 
Gemini Deep Deep Survey (| Abraham et al.l I2004T ). A nod- 
and-shuffle observing mode for fibre-optic MOS had been 
provided with the 2dF system at the AAT but the sig- 
nificant practical inefficiencies of the implementation ham- 
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pered observations. However, the AAOmega spectrograph 
was commissioned with 200 and 400 fibre nod-and-shuffle 
modes readily available. 

For nod-and-shuffle observations, the telescope is nod- 
ded back-and-forth between the target and adjacent blank 
sky at a frequency of 0.5-2 Hz. During the the telescope nod 
the charge accumulated on the CCD is transferred without 
loss (shuffled) to a storage area and a second region of the 
CCD is exposed to the sky flux now illuminating the fibre. 
This process is repeated a number of times before the CCD is 
readout. This mode allows quasi-simultaneous observations 
of source and sky, through an identical light path. Allocat- 
ing a pair of fibre to each source and nodding the source 
between each fibre in the pair mitigates the implicit 50% 
on-source duty cycle at the expense of target multiplex. 

As will be seen below fibre-optic nod-and-shuffle per- 
forms well. However, as well as significant practical ineffi- 
ciencies such as telescope slew-and-settle time and reduc- 
tion in target multiplex the nod-and-shuffle technique also 
introduces a significant reduction in the signal-to-noise ratio 
achieved, with respect to the dedicated sky fibre approach, 
for a finite observing time (see [J7j). 



2.3 Overview of Parkinson etal. PCA 
implementation 

The use of a PC A approach for sky s ubtraction was first 
demonstrated by iKurtz fc Minkl (|2000l ). although for that 
first implementation classical long-slit spectroscopic data 
was used to simulate the performance of the procedure. 
The first true test of th e PCA approach with fibre spec- 
troscopy was reported bvlWild fc Hewett 11200511 using data 
from SDSS. However, IWild fc HewettJ (|2005l ) focused on 
cleaning up relatively shallow spectroscopy. A goal of many 
fibre MOS projects has been to achieve more sensitive 
observations. Indeed, the WiggleZ Dark Energy survey 



Drinkwatcr et al. 2009) utilized a modified version of the 
Wild k. He wett 2005) PCA code to aid in the identifica- 
tion of faint (r(AB)<22.5) z > 0.5 emission- line galaxies 
whos characteristic emission lines are otherwise lost in the 
dense OH-forest at wavelengths beyond ~7000A. 

While early implementations have been successful, the 
PCA sky subtraction technique has seen little use over the 
past decade. There are numerous underlying reasons for 
this lack of interest in a potentially powerful technique: 

- Much early work was undertaken at relatively blue 
wavelengths where the OH sky-line forest is spares and lines 
are of only moderate intensity. With the advent of modern 
large format CCD system which retain high quantum 
efficiency out to 1 /im and beyond an important new 
spectroscopic window is opening up to fibre spectroscopy. 

- Slit systems report excellent results from accurate 
slit mapping in 2D, so the technique has not been required 
for long-slit or slit-mask spectroscopy system^ 2 ]. While this 
will remain true in the coming decade, there is no shortage 



2 However, systematics do remain, even after correction with a 
suitable slit function (commonly due to flexure in the spectro- 
graph). A detailed measurement with long duration observations 
has not been performed. 



of current or planned fibre MOS systems, largely due to 
the relative ease of implementation for fibre MOS systems 
accessing large fields of view. 

- Relatively bright targets have traditionally been 
observed with fibre based MOS systems. This is in part 
due to the historically somewhat marginal results that have 
been achieved on fainter sources due to poor control of 
systematic spectral defects in long duration exposures. It is 
these defects that the PCA process addresses. 

- Most fibre MOS systems report residual error esti- 
mates close to the Poisson residuals expected for routine 
observations and so there has been little need for improve- 
ment. We demonstrate that the application of the PCA ap- 
proach will allow significantly more difficult observations to 
become routine. 

The approach to PCA sky subtraction implemented 
for the AAOmega system at the AAT varies from the 
previous works and is described in detail in lParkinson etali 
(|in prep.1) . The implementation used in this work was a 
standalone processing script which ran on the reduced 
data products from the 2dfdr data reduction pipeline. 
The process has now been incorporated into a pipeline 
processing option within the body of the reduction code. A 
brief overview is given here: 

- Each target field is observed with ~400 science fibres. 
Of these, 20-25 are allocated to blank sky positions across 
the field to provide night sky spectra free from astronomical 
sources. 

- The data are processed in the usual manner using 
the 2df dr data reduction software. To perform a first order 
sky subtraction a nraster sky spectrunr is created from a 
median combination of the dedicated blank sky spectra. 
This spectrum is then scaled and subtracted from each 
science frame using the median flux in strong OH night sky 
lines to determine the required scaling. 

- Each valid spectrum (i.e. source and sky fibres 
are used, but bright calibration sources and unused fi- 
bres are discarded) is then continuum subtracted using 
a rolling median filter to remove large scale source structure. 

- A set of eigen vector spectra is calculated using a 
Principle Components Analysis for the ~400 sky subtracted 
spectra. 

- Each science spectrum is then sky subtracted using a 
linear sum of the thirty most significant principle compo- 
nents. 

The main de partures of the GAMA project PCA 
implementation of IParkinson et all (|in prep. I ) from that of 
IWild fc Hewettl (|2005l ) for data from SDSS are: 

- The eigen vectors and values are generated on a 
frame-by-frame basis, rather than from a large data base of 
spectra. This addresses many of the major sources of poor 
sky subtraction residuals (|A1.1|) . 

- Object and sky spectra are used to generate the 
components. This clearly breaks down if numerous objects 
have very strong structure at common wavelengths (e.g. 
M-star contamination of a galaxy survey) or if many similar 
objects are found within a given frame (i.e. during galactic 
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survey work). 

- Spectral features are not masked since they are not 
know a priori. Indeed, enhancing the detectability of such 
features within a strong foreground signal is the goal of this 
work. 

- With only ~400 input spectra for each frame, only 
a small number of eigenspectra can be use to perform the 
sky subtraction. In the work present here thirty compo- 
nents have been used. This number was selected for initial 
investigations as reported here by inspection of the result- 
ing spectra. A full investigation of the methodologies for 
parametrisation the eigenvector space from which compo- 
nents are drawn is underway. 

2.3.1 Continuum concerns 

Sky and source continuum presents a problem. In princi- 
ple the residual sky continuum should represent a separate 
set of distinct components in the eigenvector set produced 
by the PCA algorithm. In practise however, the residual 
OH and continuum signals and the source spectrum do not 
represent orthogonal basis sets and so intermixing between 
source spectral components in such data is inevitable. It will 
then not be possible to adequately represent any individual 
sky subtraction residual pattern using a reconstruction from 
a li mited set of the s e eige nvectors. 

Kurtz fc Minkl (2000) deal with the problem via a two 
stage fitting process which has the unfortunate side effect 
of invalidating the spectrum for m e asurem ents of the con- 
tinuum properties. IWild fc Hewettl IJ2005T ) rely on the fact 
that each SDSS spectrum has had a preliminary sky sub- 
traction to remove sky continuum components prior to their 
correction for OH line residual via the PCA. Object con- 
tinuum is removed via median filtering each spectrum in 
w avelength. In thi s regar d our approach is identical to that 
of I Wild fc Hewettl (J2005h . 

Ultimately it may be possible to assess the selection 
of eigenvectors and weighting based on those that preferen- 
tially contribute to residual correction in the blank-sky fibres 
spectra (perhaps requiring an increased number of dedicated 
sky spectra per observation). We have yet to investigated 
this aspect of problem. In short, we expect our approach 
to provide a reliable result for low intensity source spectra 
taken on moonless nights (so that the first order sky correc- 
tion is adequate). The primary interest should be focused 
on the red end of the spectrum where the error budget is 
dominated by OH subtraction errors and the source distri- 
bution should be such that individual spectra have little in 
common in the observed frame. A small number of high sig- 
nal, or repetitive spectra (such as calibration stars) should 
be clipped from the generation of the initial eigenvector set. 

For work in which absolute spectral fidelity at the high- 
est level is of paramount importance at the expense of sig- 
nificant survey efficiently - not typically the primary moti- 
vation in work such as a wide area galaxy redshift survey - 
one may require the nod- and- shuffle technique. 



2.4 Sky subtraction accuracy 

Example sky spectra as observed by the red arm of the 
AAOmega spectrograph are given in Fig. [T] Two horizontal 



levels are marked. These indicate the residual Poisson er- 
ror, expressed as a fraction of the local sky spectrum, which 
would be expected under the assumption of pure Poisson 
noise in the spectrum of a 1 hour sky observation. The ma- 
jority of the spectral region considered here is limited to a 
residual noise level between 1-3% of the sky signal in each 
spectral pixel. 

We require a metric with which to determine the sky 
subtraction accuracy achieved in any given observational 
data set. The figure of merit we use is the local error which 
we estimate as follows. 

A smoothed spectrum is constructed using a sliding 
square filter (with smoothing length of ±100 pixels) and 
computing a 5-ct clipped mean at each pixel. For each pixel 
the rms intensity within the smoothing window is taken as 
an estimate of the local sky subtraction accuracy. 

To confirm the validity of this error estimator a number 
of simulations are undertaken. Random realisation of noise 
spectra are generated with pixel intensities drawn from two 
distributions. The first is a Normal distribution with width 
set to that indicated by the variance array information 
which is generated for each spectral pixel during data 
reduction. A second realisation is generated from a normal 
distribution with width set by the local error estimate 
defined above. A thousand random realisation of each dis- 
tributions are generated for each spectral pixel and binned 
histograms of these model pixel intensities are compared 
to the pixel intensities for the observational spectra after 
subtraction of the smoothed spectrum. Inspection of the 
these residual error histograms indicates 

- The variance estimates from the data reduction process 
are an accurate description of the true noise properties of 
the data, with perhaps some small evidence for departure 
(likely due to small errors in the gain and CCD read-noise, 
N r d, estimation). 

- The local error estimator reliably reproduces the distribu- 
tion of observed pixel indensities and hence is a valid repre- 
sentation of the residual scatter in the data. 



3 EXAMPLE SPECTRA 

Sky subtraction accuracy is estimated for the experimental 
data sets detailed in [J2] using the three subtraction meth- 
ods presented in H2.4I The results are given in Tabled The 
individual data sets are discussed below. 



3.1 GAMA survey - moderate source intensity 
data 



Observations from the GAMA project (|Driver etali 12003 ; 
iBaldrv eta l. 2009) are shown in Fig. [5] The source presented 
has reported SDSS magnitudes of r(AB) = 19.17 and is found 
to be at redshifts of z=0.276. The spectrum is composed of 
3x1200 sec integrations, and was processed with the use of 
24 dedicated sky fibres to provide the master sky spectrum 
for subtraction. This master sky spectrum is shown scaled 
to 10% for comparision. While the sky subtraction does not 
hamper the identification of redshift for the source, there 
are systematic residuals in the data. 
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The lower panel in Fig. [2] shows the local error estimate 
as defined in § 12,41 The local error varies between close to the 
Poisson noise limit and over 10% of the local sky spectrum 
with an rms 4.67±0.6 over the full spectrum. 

In the absence of a Poisson component from the target 
source continuum, the local error estimate is consistent with 
the variance estimation derived from the pixel intensities 
during data reduction. 

The residual error histogram for the spectrum is shown 
in Fig. [3] confirming that the local error estimate is consis- 
tent with the residual error in the source spectra which is 
in turn broadly consistent with the spectral error expected 
from the data reduction pixel variance. 



3.2 WiggleZ data - sky limited data 

Data from the WiggleZ survey (jDrinkwater etal. 2009) is in 
a different regime from that of the GAM A survey. Since Wig- 
gleZ sources are 20 < r(AB) < 22.5 high equivalent width 
[O n] emission line sources there is little source continuum to 
introduce increased noise to the error statistic. Fig.Ushows 
such a WiggleZ survey source. The local error is close to 
the sky limited case (i.e. the limit expected from the pixel 
intensity variance estimates). 

Since WiggleZ spectra are largely free of continuum a 
stacking analysis of the all 352 science fibre spectra in this 
frame should produce an improvement in the local error es- 
timate which scales as vN for the number of fibres stacked. 
A clear departure from this scaling is seen in the upper panel 
of Fig. \S\ This failure to follow the \/N suppression is the 
result of systematic errors in the sky subtraction. 

On applying the PCA correction to the sky subtracted 
spectra and repeating the stacking analysis the local error 
estimate is seen to improve markedly. This control of the 
build-up of systematic error is easily seen in the residual 
error histogram of Fig. [5] Without the PCA correction the 
local error estimator is clearly very different from the ex- 
pected error distribution (based on pixel variance estimates 
from data reduction). With the application of the PCA cor- 
rection prior to stacking, the local error estimator and vari- 
ance data are largely in agreement. 



3.3 WiggleZ Deep - Nod-and-Shuffle data 

To compare the previous results to observations taken with 
the nod-and-shuffle observing technique we use deep obser- 
vations taken, for quality control purposes, as part of the 
WiggleZ project. A 6 hour on source observation is avail- 
able comprising three observations (taken over three nights) 
of 3x2400 sec. The data was observed in the Cross-Beam- 
Switching (CBS) mode with a pair of fibres allocated to each 
target. The observations where taken in the conventional 
AAOmega N+S mode with half of the available science fi- 
bres masked in order to provide the requisite CCD storage 
space (rather than the rece ntly implemen ted mini-shuffling 
mode discussed later and in lSharp I ((2009|)) . 

A representative spectrum for a typical source 
(r(AB)=22.26, z=0.851) is presented in Fig. [7] The local 
error estimator indicates that the observations are indeed 
close to the Poisson noise limit expected for the N+S ob- 
serving process. 



4 STABILITY WITH LONG DURATION 
EXPOSURES 

Observations of faint sources require long duration expo- 
sures to build up the required signal-to-noise ratio within 
each spectrum. In the limit of pure Poisson-noise in the pho- 
ton arrival rate, signal-to-noise is expected to build as the 
square-root of the observation time. However, systematic 
defects in the spectrum, most commonly from systematic 
failure of the sky subtraction, will reduce the efficiency of 
this accrual of signal-to-noise. The common causes of these 
systematic defects in fibre spectroscopy are considered in ap- 
pendix [XlT] Determining the point at which each sky sub- 
traction methodology becomes limited by systematic defects 
is key to selecting the most efficient observational strategy 
to achieve ones goals. 

In the absence of an ideal data set we mimic long du- 
ration exposures using two methods. First we consider the 
effects of stacking fibre spectra from within a single MOS 
frame. Secondly we investigate stacking individual fibres 
from across multiple independent observations. The former 
is not ideal as there may be strong correlated noise patterns 
between fibres in a single frame which would average out 
over many frames. The later is difficult in practise as a suit- 
able data set, which places a single fibre on blank sky across 
many independent observing frames, is not usually available. 

The residual error as a function of time for a single 
frame stacking process are shown in Fig. [8] Single frames 
from the GAMA, WiggleZ and WiggleZ N+S data sets pre- 
sented previously are used. Additionally, in order to con- 
struct a single fibre multi-frame stack we use the properties 
of the WiggleZ survey data to construct a stack of +200 
individual fibre spectra from data taken throughout 2009 
(Fig. |SJ. In each case stacked spectra are created with a 
range of values of N, the number of fibres used. For each 
value of N, 100 random realisations of the stacked spectrum 
are created and the local error and its rms scatter are plotted 
against the number of spectra used in the stack. Each stack 
of N fibres simulates an exposure of N hours on blank sky. 
The rate of reduction in residual error with time is then com- 
pared to the yN rate of decline expected from pure Poisson 
statistics. 

It is clear from Fig. |S]that for both the data sets which 
use purely the dedicated sky fibre subtraction, a systematic 
noise floor is reach after ~10 hours. With the application of 
the PCA procedure to the data prior to stacking there is 
no indication of this systematic noise floor until one reaches 
~ 100 hours of simulated observation. 

The N+S observations show little evidence for a noise 
floor at the limits of the available exposure time, however 
the residual error appears to decline at a rate slower than 
the VN Poisson limit, rather the rate of decline estimated 
from the data set is N" 038 . 

To investigate the decline rate further we perform a 
single fibre stacking across all 219 observations available 
from the 2009 WiggleZ project observing program at the 
AAT. The results of this long duration exposure stacking 
are shown in Fig. [9] No explicit data quality control has 
been performed (although only data accepted for use in the 
survey has been used) and a month-to-month re-binning of 
the data was require to place data on a common wavelength 
scale. Local error estimates are shown for random realisa- 
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tions of the stacked data as before. Stacks are created for 
the data both with and without the PCA sky subtraction. 
In order to extend the experiment beyond the 219 hours of 
data available using a single fibre from each observational 
frame we also repeat the experiment using 50 fibres from 
each frame, which pushes the effective exposure time to be- 
yond 10,000 hours. 

It is clear that the dedicated sky fibre only sky subtrac- 
tion again reaches a noise floor beyond which sky subtraction 
is not improved. This occurs at a greater time than for the 
single frame experiment in Fig. [8] likely due to the presence 
of stronger correlated errors when using data from a single 
science frame. The stacked data which has been processed 
with the PCA procedure continues to show the steady de- 
cline in local residual error with no compelling evidence for 
a systematic noise floor. The rate of decline in the local er- 
ror is however not N~ 0,5 as would be expected from purely 
Poisson noise considerations, but rather N -0 ' 32 , closer to the 
cubed root of exposure time. There is of course a small con- 
tribution to the Poisson noise from the intrinsic source flux 
in each spectrum which, while small and randomised over 
redshift in this experiment, is none zero. No comparable du- 
ration sky-only data set is available with which to avoid this 
contribution at this time. 



4.1 Comparison with N+S survey observations 

The long duration experimental results from Fig. [5] indicate 
that the PCA sky subtraction process applied to dedicated 
sky fibres observational data produces excellent results. We 
now ask how these results compare to those obtained with 
the N+S process. As stated previously, the N+S process 
requires longer individual integrations to reach a common 
signal-to-noise level in the data. If single fibres are allocated 
to each science target, the N+S observing procedure requires 
x4 the integration to reach the same signal-to-noise level 
as a dedicated sky fibre observation (due to the 50% duty 
cycle on source while still retaining the full 100% of the 
noise budget from the sky spectrum in both the A and B 
components of the nod observation). If a pair of fibres is 
allocated to each target and cross-beam-switching (CBS) is 
used, then this becomes only x2 in exposure time. We shall 
for the moment ignore the reduce targeting efficiency implied 
by allocating two spectroscopic fibres to each target. 

Since N+S observations require this extend observation 
time with respect to the dedicated sky fibre observations, we 
should therefore compare the local residual error estimate of 
the N+S observation with a dedicated sky fibre and PCA sky 
subtraction observation of the equivalent overall exposure 
time, in order to determine the optimal observation strategy. 
This comparison is made in Fig. 1101 

For short duration exposures, the local error estimate 
for dedicated sky fibre data and for N+S data is comparable, 
and consistent with the Poisson limit (as demonstrated in 
Figs. I2I7|) . For the PCA processed observation, we assume 
the r -0,32 rate of reduction in local error estimated from 
Fig. [5] For the N+S data, we apply the x4 (x2 if using 
CBS fibre pairs at the expense of raw target numbers per 
observation) increase in exposure time to two rates of re- 
duction in the residual error. Firstly we use the theoretical 
value of N -0 ' 5 , secondly we use the slower rate of N -0 ' 38 ob- 
served in Fig. [8] We wish to find the exposure time at which 



the local error is lower with increasing exposure time when 
using the N+S observation strategjQ These values can be 
calculated from the assumed gradients as shown in Fig. 1101 
and are found to be 47 hours (6502 hours) without the CBS 
fibre pairs assuming the -0.5 (and -0.38) power of decline 
and 7 hours (81 hours) if CBS fibre pairs are used. 



5 VISUAL COMPARISON OF SPECTRAL 
QUALITY 

To illustrate the spectral quality of the various sky subtrac- 
tion methodologies we present the composite spectra, from 
the stacking analysis using each of the processing method- 
ologyies, in Fig. 1111 The indicated number of spectra are 
combined (using a clipped mean) after subtracting a smooth 
continuum from each. The modulus of the residual spectrum 
is shown in each case, overlayed with the error array derived 
from the pixel variance estimates during data reduction. In 
each case four data sets are shown. In the lower spectrum we 
see the 3x1200 sec observation using the dedicated sky fibres 
method. The second spectrum is the same frame but with 
the PCA process applied to the data prior to stacking. The 
third and forth spectra are the N+S observations, the third 
does not implement CBS fibre pairs while the upper spec- 
trum does. The N+S frame was composed of 3x2400 sec ob- 
servations and so the CBS frame contains 2 hours on source 
data, while the the other contains only 1 hour. 

For the dedicated sky fibre observations the residual 
error is significantly in excess of that expected from the 
data reduction process and increasingly so as more spec- 
tra are combined (in agreement with Fig. |SJ. The N+S data 
is broadly consistent with the variance estimation but as 
is expected the level of this residual error is higher due to 
the increased sky component of the N+S observations. The 
frame processed with the PCA sky subtraction is visually by 
far the cleanest spectrum, supporting our previous analysis. 
It gives the greatest signal-to-noise data with the lowest sky 
residual for the shortest on sky integration time. Further- 
more it has considerably smaller observational overheads in 
comparison to the N+S observations and maximises the fibre 
multiplex since there is no need for cross-beam-switching. 

An alternative assessment of the data quality is pre- 
sented in Fig. [T2] Here we create stacked spectra from the 
219 WiggleZ observations from 2009. In each case the spec- 
trum is presented with and without the PCA sky subtrac- 
tion. Insufficient N+S observations are available to generate 
the comparison spectrum at this time. The suppression of 
systematic residuals in the spectra is clearly visible in the 
PCA processed data. No continuum subtraction was applied 
to the fibres in Fig. [12] and so the weak source continuum 
from the high equivalent width emission line targets of the 
WiggleZ survey is visible in the averaged spectrum. 



3 Clearly if the true rate of error reduction for N+S observations 
is identical to that of the PCA observations, N -0,32 , this situation 
is never achieved. 
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6 QUANTITATIVE SURVEY ANALYSIS 

The most apparent benefit of the PCA analysis to the user, 
for faint-object spectroscopy, is the reduction in systematic 
cosmetic defects. In order to provide a quantitative assess- 
ment of the improvement a detailed simulation has been 
undertaken. AAOmega survey data was selected from six 
clear dark nights from the September 2008 observing cam- 
paign for the WiggleZ project. This provided 85x1200 sec 
science frames. As discussed previously, WiggleZ targets are 
primarily high equivalent width faint (r(AB)~22) emission- 
line sources hence each of the ~354 science fibres largely rep- 
resents blank sky observation. Additionally, each date frame 
contains 25 fibre allocated to preselected blank sky positions 
which are free from the low level Poisson noise contribution 
from the faint continuum still present in fibres allocated to 
WiggleZ science targets. In what follows the blank sky fibre 
provide the idealised test data set while the science fibres 
observations are used to provide a larger sample size. 

Each CCD frame was processed using the 2df dr soft- 
ware environment and the standard data reduction recipes. 
For each individual reduced data frames (1200 sec integra- 
tion) a synthetic spectrum was generated from a template 
early type galaxy. The galaxy template used was the in- 
term ediate age early typ e spectrum from the GDDS sur- 
vey l| Abraham etal.1 12004 ). The galaxies where distributed 
in redshift between 0.7<z<1.3 which is the limit imposed 
by the rest frame spectral data matched to the observed 
spectral range. This redshift range see the Calcium H+K 
absorption lines and the Balmer break transit the strongest 
OH sky-line regions in the red AAOmega spectrum. Spectra 
where generated for a series of source intensities scaled to 
match count rates equivalent to apparent J (Vega) =20-24, 
with the appropriate target Poisson noise included. 

The PCA sky subtraction was then applied to each re- 
duced science frame and image stacks where created at each 
simulated magnitude level with and without the PCA cor- 
rection. A redshift analysis was undertaken with the fully 
automated mode of the runz cross-correlation code typically 
used with data from the 2dF/AAOmega system. The fully 
automated analysis was chosen in preference to the more 
typical semi-interactive methods more common to faint ob- 
j ect p r ograms und e rtake n with AAOmega (e.g. IWake et al.l 
(2006); IRqss et al] {2006)) to avoid subjective results. For 
this idealised test case, only a single cross correlation tem- 
plate was used, the same spectrum used in the generation of 
the mock data. The prior for the redshift distribution was 
match to that of the mock test data input range. 

The redshift analysis results are shown in Fig. 1131 
The upper panel indicates the measured redshift returned 
by RUNZ as a function of input model redshift for the 
/(Vega)=21 simulated data set for the full 11.3 hour 
(85 frame). With the PCA processed data we recover the 
correct redshift for 85% of simulated sources. The misidenti- 
fied spectra are preferentially at the highest redshifts, where 
the /-band normalisation results in the lowest signal-to-noise 
ratio spectra. In the absence of the PCA sky correction, the 
success rate is reduced to 25% with a number of distinct 
failure modes clearly present. 

A comparison of the model spectra derived from blank 
sky fibre spectra rather than those containing WiggleZ sci- 
ence targets demonstrates that the residual noise contribu- 



tion from the faint WiggleZ targets present in the science 
spectra used for this analysis does contribute a reduced effi- 
ciency for the test and will therefore contribute to the slope 
measured for the sky error reduction rate seen in Fig. [9] 

The lower panel of Fig. [13] shows the declining recov- 
ery rate for fainter model spectra. At fainter input mag- 
nitudes, the fully automated RUNZ code, which was not 
specifically tuned for such a regime, does remarkably well 
even at 1=23. With the planed upgrade of the red arm of 
AAOmega to a new generation high-resistivity CCD (boost- 
ing quantum efficiency in the range 800nm-l/im by a fac- 
tor of four) it will be possible for AAOmega to continue to 
conduct efficent wide field surveys for earl y type galaxies 
{Cannon. Drinkwater. Eisenstien etal\ \2006) out to at red- 
shift z~l. 



7 CONCLUSION 

We have demonstrated that the use of a Principle Com- 
ponents Analysis sky subtraction with dedicated sky fi- 
bres obse rvations in a man ner similar to tha t pre - 
scribed bv lKurtz fc Minkl {2000] ) and I Wild fc Hewed {20051 ) 
(and subsequen t ly imple mented for the GAMA survey by 
IParkinson etali {in prep.1 )). provides excellent quality astro- 
nomical spectra over long duration exposures. For moder- 
ate length exposures (10-100 hours on source) the spectral 
quality is comparable to that obtained using the N+S ob- 
servational technique, and likely for significantly longer ex- 
posures. We therefore conclude that for fibre based multi- 
object spectroscopy, in instances were high multiplex and 
observing efficiency are critical, the PCA approach is supe- 
rior to nod-and-shuffle based techniques for moderate signal- 
to-noise extragalactic survey spectroscopy. 
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APPENDIX A: SKY SUBTRACTION 
METHODOLOGIES 

The limiting sensitivity of a spectral observing system is ul- 
timately a question of the signal-to-noise ratio achievable for 
an given observation. In the faint source limit the Poisson- 
noise component pertaining to the source photon arrival 
rate can be largely ignored. Additionally one may assume 
an observational strategy has been adopted such that noise 
injected by the detector system (read-noise) may be dis- 
counted. In what follows we shall also assume no signifi- 
cant contribution from a thermal or scattered light back- 
ground. Under these circumstances, at wavelengths in the 
range 700nm-1.7/im the limiting factor for system sensi- 
tivity becomes the Poisson-noise associated with the fore- 
ground screen of emission from the earths atmosphere. On 
a moonless night, the dominant component of this emission 
is the molecular emission bands from upper atmosphere OH 
radicals, with an additional contributions from bands due to 
2 (Ellis & Bland-Hawthorn 2008). An example of this OH- 
airglow spectrum, as seen by the red arm of the AAOmega 
spectrograph, is shown in Fig. [T] We will return to the trou- 
blesome effects of scattered moon light later. 

While this sky signature may be subtracted from science 
spectra, the random noise component remains and can be 
suppressed only through the collection of a greater quantity 
of photons, be that via a larger telescope collecting area or 
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Figure 2. An object spectrum from the GAMA survey is shown 
to illustrate the sky subtraction accuracy for moderatly bright 
sources using the dedicateted sky subtrcation technique. 
The upper spectrum is composed of 3x1200 sec exposures of 
1 hour on source using 24 dedicated sky fibres to construct the 
master sky spectrum used for sky subtraction. The emission line 
source shown (Ha/[Nll]/[Sll]) is at a redshift of z=0.276. The 
source catalogue magnitude is r(AB)=19.17. The plotting range 
is chosen to show regions of strong atmospheric OH and O2 emis- 
sion, sky line free continuum regions and two Fraunhofer telluric 
absorption bands. 

The lower plot shows the local error in the spectrum, estimated 
as discribed in § 12.41 The error is presented as a percentage of the 
local sky spectrum at each wavelength. The Poisson limit deter- 
mined from pixel varaince array during data reduciton is indicted 
by the thick line. The Poisson limit is approached in the vicinity 
of the strong emission lines. Between the emission features, the 
residual is dominated by Poisson noise from the source spectrum. 



an extended exposure timqj. In the sky-noise limit increased 
depth is gained only at the square root of the increase in ei- 
ther. In the presence of even subtle systematic defects in the 
recorded spectrum of a faint target, increasing the exposure 
time without end will ultimately reach a noise floor limited 
by the presence of systematics (Figs. [SJ&O. The depth at- 
tainable with any observing system is therefore limited by 
ones control of sources of systematic error. 

Since fib res were fi rst used for astronomical spec- 
troscopy (see iHilll (|l9881 ) for an historical perspective) a 
number of strategies have been implemented to attempt to 
remove the sky signature which is signed, in what often ap- 



An important caveat being of course that increasing the 
source:sky photon ratio via the use of a smaller aperture coupled 
to improved image quality has an important effect. 
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Figure 1. Representative sky spectrum are shown taken from the red arm of the AAOmcga spectrograph. 

Upper panels - The 5700A and 6700A dichroic beam-splitters have been used with the 385R VPH diffraction grating to cover the 
wavelength range 6000A < A < 9800A. The strong OH air-glow lines and the unresolved emission band of O2 are evident. The spectra 
are gain and exposures time corrected to units of electrons per hour. They are not flux calibrated and so line relative intensities are 
subject to the instrument response functions. Horizontal lines marks the 1% & 3% sky subtraction accuracy limits from Poisson noise. 
Lower panels - Left) The overlap region of the two sky spectra are compared. The data were taken two weeks apart and a significant 
variation in the OH sky-line scaling is evident. The WiggleZ survey spectrum has been scaled by a factor of 1.2 to match the O2 band, 
but the relative intensity of the OH lines remains higher in the April data. Right) The estimated residual error levels (1-cr assuming 
purely Poisson noise dominated by the sky emission level) for the two sky spectra are also shown, along with the expected contribution 
from CCD readout noise assuming three independent 1200 sec exposures. 



Table 1. Details of the example AAOmega data sets used in this analysis. 



Sky subtraction mode 


Program 


Exp. details 


Observation Date 


Dedicated sky fibres 


GAMA survev (Driver et al. 2009) 


3 X 1200 sec 
3 X 1200 sec 


27 March 2008 


Dedicated sky fibres 


WiggleZ survey (Drinkwatcr et al. 2009) 


25 March 2008 


Nod-and-Shuffie 


WiggleZ survey (Drinkwatcr et al. 2009) 


3 X 2400 sec X 3 nights 


21, 25 & 26 September 2008 


Extended PCA test 


WiggleZ survey (Drinkwatcr et al. 2009) 


3 x 1200 sec 


2009 



pears to be indelible ink, across every spectrum. A number 
of standard strategies have evolved over time; we discuss 
each in turn below. 



Al Dedicated sky fibres 

The most common method for sky subtraction with fibre 
multi-object spectrographs is to utilise dedicated sky fibres. 
In this model, a number of fibres are allocated to previously 
determined blank sky positions across the field of view. This 
allows the observation of the sky spectrum simultaneously 



with the science observation removing the effects of temporal 
variations in the night-sky spectrum. 

These sky spectra are processed alongside the science 
data and combined to provide a photon-noise suppressed sky 
spectrum (benefiting from the \/n noise suppression from 
multiple independent samplings of the sky) . This combina- 
tion creates a master sky spectrum effectively noise free by 
virtue of a high signal-to-noise with respect to the science 
observations. To achieve this noise free sky spectrum, one 
must typically combine 15-30 individual spectra. The exact 
nature of the sum depending on the observational require- 
ments, and factors such as the cosmic ray rate at the detector 
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Figure 8. In the absence of a large number of independant data sets, long integration times can be simulated by stacking the spectra 
from multiple fibres within a data set. Local residual error is plotted (as a percentage of the local sky intensity) as a function of the 
number of fibres stacked in the three dats sets considered. The mean value ±3-cr rms scatter is shown for 100 random realisations of the 
stacked spectrum with each value of Nfibres. Only ~400 fibres are avalibale in the GAMA and WiggleZ frames. This becomes ~200 for 
the N+S observations. The solid line traces the vN decay expected from pure Poisson noise in the data. 



and the care with which the user has chosen blank sky re- 
gions (a non trivial problem for observation in some fields 
e.g. the galactic bulge). 

While the use of the dedicated sky fibres prevents the 
act of subtracting the master sky spectrum increasing the 
random noise (since the master sky spectrum is essentially 
noise free), the subtraction accuracy is at the mercy of a 
series of systematic defects as outlined below. These sys- 



tematic effects limit the ultimate depth of observations with 
the dedicated sky subtraction method, shifting the signal- 
to-noise firmly off the \fr curve-of-growth one would expect 
from a purely Poisson process of photon arrival from the 
target and night sky. The number of fibres one should dedi- 
cate to sky is a topic of debate for many a survey program. 
Clearly each fibre used for blank sky is unavailable for ob- 
servations of a science target object and so directly reduces 
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Table 2. Summary of sky subtraction accuracy results. For each data set, the predicted 5-cr (per spectral resolution element) ij-band 
magnitude limits is indiacted 



Data set 



R. Poisson error Local error 

mag(Vega) % sky % sky 



Exposure time details 



GAMA 
GAMA-PCA 




22 


3.53±1.17 
3.53±1.17 


4.67±0.60 
3.76±0.46 


lh on source (lh effective sky) 


WiggleZ 
WiggleZ-PCA 




22 


3.28±1.34 

3.28±1.34 


3.57±0.38 
3.01±0.29 


lh on source (lh effective sky) 


WiggleZ Deep 
WiggleZ Deep 
WiggleZ Deep 


CBS 
Full 


21.25 
21.65 
22.25 


2.32±0.95 
1.64±0.67 
0.95±0.40 


1.80±0.13 

1.27±0.10 
0.87±0.07 


2h sky, lh on source (2h effective sky) 
2h sky, 2h on source (4h effective sky) 
6h sky, 6h on source (12h effective sky) 



Note: for the N+S data, the sky is effectivly twice as strong, as there is a noise contribution from both the on and off observing 
positions. For the CBS frames the sky is still double the open shutter time, however the on source exposure time is also the full open 

shutter time. 
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Figure 3. The residual error per pixel is assessed for the spec- 
trum presented in Fig. [2] The local error estimate reproduces the 
distribution of data values well. 



the multiplex advantage. For AAOmega observations at the 
AAT 20-25 fibres are typically allocated to blank sky in a 
given configuration, ~5% of the full complement of available 
science fibres. It should be noted that this order of overhead 
will likely remain for systems with many more fibres than 
AAOmega. The available fibres of such a system will likely 
be separated across multiple spectrographs and for reasons 
of PSF stability (outlined below) 15-25 fibres will likely be 
needed per spectrograph. 

The master sky spectrum is subtracted from all object 
spectra after an appropriate scaling factor is derived to ac- 
count for fibre relative response variations. Standard dome 
flat frames typically cannot be used to determine this rel- 
ative response since it is not usually feasible to provide a 
uniform intensity illumination to all fibres across the field- 
of-view of the observing system. For example the quartz- 
halogen flat field lamp frames used with AAOmega observa- 
tions provide an excellent wavelength dependant relative re- 
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Figure 4. Fig.[2]is repeated for a source from the WiggleZ survey. 
WiggleZ targets are selected to be r(AB)>21 emission line galax- 
ies and so each spectrum is free from contamination from source 
continuum in most pixels. The source shown has r(AB)=21.63 
and an emission line redshifts of z=0.6117. The spectrum is a 
3x 1200 sec observation. The lack of significant objects continuum 
accounts for the improved local error estimates for the WiggleZ 
spectra over that shown in Fig. [2] The local error in the spec- 
trum approached the Poisson limit in the vicinity of strong OH 
emission features. 



sponse fibre-to-fibre, but provide no total intensity normali- 
sation due to a strong illumination gradient in the flat field. 
This illumination gradient results from the short path length 
(~2m) between the light source and the diffuser screen. For 
low resolution sky limited AAOmega observations this scal- 
ing is usually derived from measurements of the very night 
sky air-glow lines one aims to subtract from within the sky 
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Figure 5. Since the majority of targets from the WigglcZ sur- 
vey possess little continuum a stacking analysis can be performed 
using all 352 science fibres from this frame. This is shown in the 
upper pane. The presence of systematic defects is evident in the 
rate of residual error reduction when comparied to the error level 
expected from the variance arrays of the combined spectrum. 
In the lower panel the stacking process is repeated after applying 
the PCA correction to the data. The local error is largely indistin- 
guisable from the Poisson error in this vissual representation. The 
associated residual error histograms (ij |2,4J l are given in Fig. [6] 



and science target spectra. The master sky spectrum is then 
scaled to match the measured line intensities for subtraction. 



Al.l Problems associated with dedicated sky fibre 
observations 

There are a number of issues which limit the sky subtrac- 
tion accuracy typically achieved with dedicated sky fibres. 
The first five problems considered below amount to system- 
atic modification of the individual input spectra via cross- 
contamination or the addition of a non-astronomical com- 
ponent which largely invalidates the dedicated sky fibres ap- 
proach. The remainder are concerned with the introduction 
of small scale variations in the apparent profile of fibre im- 
ages on the detector, the Point Spread Function (PSF). 
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Figure 6. The residual error histrograms for the stacking analysis 
of for all 352 WiggleZ spectra from Fig. [5] The first is that for 
the 352 spectrum stack without the PCA analysis. The second is 
for the same data set but with the PCA sky subtraction applied 
to the data prior to stacking. The PCA process suppresses of the 
systematic error component and the local error estimate is much 
closer to that expected from the spectral variance estiamtcs. 



exposure(j The first order effect of strong moon-light gra- 
dients in the field will be that line and continuum spectral 
components scale differently across the field and cannot be 
adequately accounted for using a single scaling derived from 
a single component master spectrum. 



Continuum variations 

While there is little evidence on small angular scales for 
structure in the night sky OH emission, at least at wave- 
lengths shorter than < 1 um, gradients in the continuum 
emission particularly during bright- of -moon do significantly 
modify the observed sky spectrum as a function of field po- 
sition (certainly for instruments with a wide field-of-view 
such as the -k deg 2 of the AAOmega system) . Indeed for very 
wide fields of view, spatial variations in OH emission inten- 
sities may not in fact average out over the duration of an 



CCD fibre packing 

Perhaps the most mundane effect one must account for is 
simply the fibre packing onto the CCD. With CCD real- 
estate at a premium, there is an ever present temptation 



5 Scattered moon light from a poorly baffled telescope super- 
structure would of course be particularly damaging. It would be 
almost impossible to correct for such an additive systematic error, 
since it will be highly variable both spatially and temporally. 
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Figure 7. The full 6 hour WiggleZ N+S observation of an emis- 
sion line source is shown. The source has r(AB)=22.26 and the 
redshift indicated by the [On] emission line is z=0.851. Using 
a CBS fibre pair the observation comprises 6 hours on sky and 
6 hours on source. 



10.0 r 



TT r 



TT r 



o Dedicated sky 

• Dedicated sky (x50) 

□ PCA 



1.0- 



0.1 



0.1 




10.0 100.0 

N frames (N hours) 



1000.0 10000.0 



1.0 



0.1 



:\ 




: 


■ \ 




- 


\ 






\ 






" ^ 




. 


v^ 






\ 


\ NfflSg 


\ I - 


N-°- 5 
- N-°- 32 


\ 


r^^ 


_ ^^m N+S CBS 




\ ^Ifffligkv ' 


ZZZZZN+S 




\ \[n^^^x i 









0.1 



1.0 



10.0 100.0 1000.0 10000.0 

N frames (N hours) 



Figure 10. The sky residual decline rate with incearsed exposure 
time, for the PCA sky subtraction, is reproduced from Fig. [9] Two 
additional zones are added to the figure. Each zone shows the sky 
subtraction accuracy expected for N+S observations assuming 
slopes of N -05 (the theoretical limit) and N — °- 38 (that observed 
in the observational test data presented in figure Fig. [8] The two 
zones have been shifted along the horizontal axis (N frames, or 
rather exposures time) by the multiplier required over standard 
dedicated sky fibres observations in order to achive the same fi- 
nal signal-to-noise ratio in the spectrum (with and witout CBS, 
i.e. x2 or x4). It is not until the green (PCA) line crosses these 
two boundaries that N+S observations become more efficent than 
dedicated sky firbes observations. 

The cross-over points indicated are 47 hours (6502 hours) for No 
CBS with the slope of N" ' 5 (N" - 38 ). This becomes 7 hours 
(81 hours) if CBS is implemented. 



to push optical fibres closer and closer together in order 
to increase the multiplex for a given detector area. For the 
AAOmega system at the AAT, the fibre pitch (separation 
between adjacent fibres) is ~10 pixels, with the fibre profiles 
projecting an approximately Gaussian FWHM~3. 4 pixels. 
A tighter pitch results in significant overlap (cross talk) be- 
tween adjacent fibres. Close packing hampers scattered light 
(background) estimation and is particularly problematic in 
the high contrast calibration frames such as flat fields, where 
one requires high intensities to reduce Poisson noise sensi- 
tivity, but which may compromise accuracy by inducing sig- 
nificant cross-talk errors between fibres. Improved extrac- 
tion methodologies, which better a ccount for fibre-to-fibre 
cross-contamination, are invaluable IjSharp fe Birchallll2010l ; 
iBolton fc Schlegelll2010l ). 



Figure 9. Some 219 independant frames are available from the 
2009 WiggleZ observing campaign with which to undertake a 
stacking analysis. The hollow points are constructed by combin- 
ing a single fibre from each of a N frames (±3-ct rms scatter). The 
filled points simulate a much longer exposure by including 50 fi- 
bres from each of N frames when making the stacked spectrum. 
A dashed line marks the VN rate of decline. 



Flat field irregularities and scattered light 

Imperfect flat fielding within the spectrograph will introduce 
errors. A common problem is imperfect removal of scattered 
light on the detector system either from a poor account- 
ing for the wings of individual fibre profiles, or from low 
level diffuse scattering and ghost events from the myriad 
optical elements within the spectrograph light path (strong 
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Figure 11. Stacked spectra are created using the three sky sub- 
taction methodologies under consideration. The moduls of the 
residual error in the stacked (continuum subtracted) spectra is 
shown, along with the error estimate derived from variance array 
propergation during data reduciton. Vertical lines mark strong 
Telluric bands which are not correctly addressd by the sky sub- 
traction process (the required multiplicative correction was not 
performed in this case). 



ghosts being of such grave consequence that they have been 
removed by adequate baffling during the detailed spectro- 
graph design process of course). Accurate removal of scat- 
tered light from science and calibration frames, particularly 
high signal level flat field frames, is es sential for accurate 
sky subtraction (|Sharp fc BirchallbOlCf ). 



Pixel-to-pixel sensitivity variations 

Pixel-to-pixel sensitivity variations must be considered care- 
fully. Any high frequency variations in the final image will 
distort the PSF of individual OH lines within the sky spec- 
trum and modulate the sky subtraction accuracy. Ideally 
pixel-to-pixel sensitivity variations should be multiplicative 
corrected via a long-slit flat field process whereby a uniform 
(both in wavelength and spatial intensity variations) source 
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Figure 12. Long duration exposures are generated across the 
219 frames available from the 2009 WiggleZ observing campaign. 
The indicated number of spectra are combined across a number of 
independent frame. Since the fibres used were placed on different 
sources during each observation, there is no coherent object signal 
present. In each case the lower spectrum shows the resulting spec- 
trum without the PCA sky subtraction. The correction has been 
applied to the components of the upper spectra before stacking, 
eliminating the systematic error residuals. The error array from 
the data reduction process is also shown. No continuum subtrac- 
tion was applied to the spectra prior to stacking and hence the 
low level composite object continuum is visible. The PCA pro- 
cessed spectrum shows significantly smaller systematic residual 
errors, consistent with Figs. [8] & [9] 



illuminates the detector and local intensity variations are 
identified through dividing such a suitably exposed frame by 
itself after moderate smoothing with a low pass spatial filter. 
For many observing systems the small amplitude of varia- 
tions found in the current generation of (low fringing) CCDs 
appear adequately corrected by the fibre-flat-field process 
without resorting to external long-slit flat fields. Fringing in 
the CCD, due to variations in the thickness of the device, is 
a related issue and for a spectroscopic system can essentialy 
be treated as part of any pixel-to-pixel sensitivity correc- 
tion. For the AAOmega system the effect of using a long-slit 
flat field to correct for pixel-to-pixel sensitivity variations is 
found to be measurable but not critically significant. 



Fibre fringing 

Any small air-gap between optical surfaces in the fibre feed 
will introduce an etalon effect and produce interference 
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Figure 13. The rate of successful redshift determination is com- 
pared, with and without the PCA sky subtraction, using a mock 
data set generated from 11.3 hours of data (85x1200 sec frames) 
from the WiggleZ survey. 

Upper panel) The redshifts recovered by the RUNZ code in fully 
automated mode are shown as a function of the input model red- 
shift. Open circles show the results without the PCA correction, 
filled symbols include the improved sky correction. The data set 
shown is for synthetic spectra with /(Vega)=21. 
Lower panel) The rate of correct redshift identification is shown 
as a function on the input model magnitude. The simulations are 
performed using the 354 science fibres, and also the 25 sky fibres 
available across each frame. 



fringing in the observed spectrum of a fibre. If this air-gap 
remains stable over time, then the fringing amplitude mod- 
ulation will be corrected via the flat-fielding process and 
amounts only to an, undesirable, loss of observing efficiently. 
However, if the air-gap is unstable with time the frequency 
of the interference fringing will change and flat-fielding will 



no longer correct the signal modulation. In this case sky sub- 
traction through the use of a master sky spectrum is doomed 
to failure since the relative intensities of sky lines are modu- 
lated by the fringing. Fibres that exhibit fringing are there- 
fore of limited value for spectroscopy. Careful preparation 
of all air/glass surfaces, and where possible immersion of 
fibre faces, is essential to guard against these catastrophic 
spectral defects. Fibre fringing has been a perennial prob- 
lem in a small fraction of AAOmega fibres. The root cause 
has been traced to differential thermal expansion properties 
of materials within each 2dF fibre button and a mechanical 
solution, removing the differential expansion, will shortly be 
implemented. 



Optical system PSF variations 

Point Spread Function (PSF) variations within the spectro- 
graph optics are a serious limitation to the accuracy of sky 
subtraction. The PSF, a measure of the projected image 
quality at each point in the spectroscopic camera, will typi- 
cally have a radial dependence due to the camera optics, as 
well as a wavelength variation. This results in a fibre-to-fibre 
wavelength dependant variation in the resolution element 
across the detector. Coupling this with manufacturing tol- 
erances on fibre diameter (typically a few percent variation 
between fibres from different batchs, with somewhat smaller 
variations from fibres constructed from a single draw) one 
realises that no two fibres will project spectra with identical 
resolution characteristics, and hence OH line profiles. 



Wavelength calibration 

More mundane but related issues regarding the fidelity of the 
wavelength calibration introduce further complexity. Typi- 
cally one might expect to calibrate a well exposed arc lamp 
spectrum to an accuracy of the order 0.1 pixel, but variations 
in external factors such as the /-ratio at which arc light is 
fed into the fibres for calibration can reduce the internal 
relative accuracy of this calibration to a point that compro- 
mises sky subtraction, even if the global external solution is 
within tolerance. 

The limited spectral (and spatial during the extraction 
process) sampling of profiles compounds these errors once 
interpolation of spectra onto a common working coordinate 
scale becomes necessary to combine individual sky spectra 
or to perform the relative baseline shift required to subtract 
the master sky spectrum from the observed data (methods 
which limit or avoid this re-sampling are possible (e.g. the 
b-spline fitting methodology use d extensivly in the SPS S 
pipeline processing software; see iBolton fc Buries! IJ2007I )). 
Increasing the PSF sampling presents the usual trade-offs of 
adversely effecting signal-to-noise and spectral coverage. 



Focal ratio degradation 

Practical construction considerations also have an impact 
on performance. In slit mask systems, the impact of slit 
roughness (primarily burrs from the manufacturing process, 
and non uniform slit width etc.) have long been known and 
are dealt with via the use of slit function calibrations, usu- 
ally from twilight flat field frames. One candidate for the 
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fibre equivalent of slit roughness is Focal Ratio Degradation 
(FRD). For the multi-mode optical fibres commonly used for 
astronomy, FRD results in a broadening of the angular dis- 
tribution of light as it emerges from the fibre with respect to 
the distribution with which it was fed in. This effect mod- 
ifies the width of the fibre profile, changing the resolution 
element, and more insidiously modifies the projected profile 
shape of the fibre. Hence the myriad sky lines in a set of sky 
spectra will have subtly different shapes at the spectrograph 
due to FRD. Careful controls during the preparation of the 
fibres as part of the fibre feed construction can reduce the 
impact of FRD and improve throughput by preventing FRD 
losses from over filled optical elements in the system (result- 
ing from poor accounting for changes in the focal ratio due 
to FRD). 

A 1.2 p-Cygni profiles 

The practical upshot of most of these limitations is a ten- 
dency for sky subtraction residuals to present the classic 
p-Cygni line profile at each and every night sky emission fea- 
ture. A slight mismatch between the master sky spectrum 
and the individual sky spectra within each object spectrum 
leads to over and under subtraction of the sky lines as a 
function of wavelength. 

A number of strategies exist for minimising the p-Cygni 
(or reverse p-Cygni) residual profiles. For example one may 
benefit from employing a secondary wavelength calibration 
derived from the air-glow lines themselves. Typically this 
will not appreciably modify the wavelength solution of any 
given spectrum, but can correct for small irregularities at 
the fraction of a pixel level. For the AAOmega system, an 
iterative optimal sky subtraction routine was adopted within 
the 2df dr data reduction packagqj. Under this scheme, the 
master sky spectrum is iteratively subtracted from each ob- 
ject spectrum and the residual assessed while fitting for line 
scaling, small velocity offsets (at the sub-pixel level) between 
the template and the data, and employing a smoothing filter 
to account for PSF variations; 

With more data at hand. I Wild fc HewettJ (120051 ) imple- 
mented a Principle Components Analysis (PCA) approach 
to the problem of correction of poor PSF re sidual sky sub- 
tractio n. This methodology, first proposed bv lKurtz fc Minkl 
(2000), has been successfully implemente d with AAOmega 
data at the AAT by both the Wig gleZ (Drink water etali 
120091 ) and GAMA JDriver et aZ.ll2009l ) survey projects. It has 
been adopted as an option within the AAOmega data reduc- 
tion environment for moderate signal-to-noise extragalactic 
spectroscopy!^ 



A2 Nod-and-shuffle 

The nod-and-shuffle (N+S ) observing technique was first 
deploy ed at the AAT by Idazebrook fc Bland-Hawthornl 
l|200ll ) for use with t he LDSS slit-mask syst em, building 
on the earlier work of ICuillandre etal.1 (J199J) and the Va 



6 Credit is due to S.Croom for this significant development within 
2dfdr. 

7 Credit for this work is due independently to PhD students 
Emily Wisnioski (WiggleZ) and Hannah Parkinson (GAMA). 



et Vient technique. The premise of N+S observing is to 
perform quasi-continuous observations of both target and 
blank sky (adjacent to the target) through the identical light 
paths, hence preserving the temporal and optical structure 
of the on and off target sky spectra. The principle is iden- 
tical to that of beam switching, but achieves a high chop- 
ping frequency (30 seconds to 2 minute dwell times) without 
incurring the onerous CCD read-out times and read-noise 
associated with classical beam switching at this frequency. 
One therefore requires an observational scheme whereby 
charge can be accumulated on the CCD for multiple in- 
dependent observations, totaling an observational time of 
the order 20-40 minutes (sufficient to reach the sky limited 
regime for observation with the low resolution AAOmega 
gratings of R~I350), and then readout the entire ob- 
servation in a single (2 minute) readout. This mitigates 
the impact of readout noise, while still allowing inde- 
pendent quasi-continuous observations of target+sky posi- 
tion pairs. Much has already been written about the me- 
chanics of CCD charge shuffling |C uillandre et alj Il994l ; 



Idazebrook fc Bland-Hawthornll200ll ; I Abraham et alj|2004l ). 
here we merely note that charge accumulated in one pixel of 
a CCD can be somewhat arbitrarily moved across the CCD, 
essentially instantaneously, with no loss of signal integrity. 

The CCD charge transfer phenomenon is fundamental 
to the basic operation of a CCD, with all charge accumu- 
lated in an ordinary observations essentially being charge 
transferred across the CCD to the readout register(s) before 
readout. Prolonged experiments indicate that, with modern 
CCDs, charge transfer between pixels is essentially loss-less 
(a charge transfer efficiency of 100%) at least in the limit 
of low signal levels (high-signal high-frequency data leaving 
some unfortunate charge transfer inefficiency residuals). 

Furthermore, while a full CCD readout may take 
2 minutes, transferring charge around the CCD is practi- 
cally instantaneous. Hence provided sufficient storage space 
is available on the CCD, one can follow the nod-and-shuffle 
procedure and accumulate charge with the telescope at mul- 
tiple pointings before performing a single CCD readout. 



A2.1 Limitations of Nod- and- Shuffle 

There are a number of immediate limitations which con- 
strain the utility of the nod-and-shuffle process. 

Duty cycle - There is a 50% on-source efficiency hit since 
50% of the time each fibre is pointing at blank sky. This 
can be mitigated through cross beam switching (CBS) as 
outlined below. 

Fibre masking - For the 2dF/AAOmega system, only 50% 
of fibres are available (using the classic N+S technique) 
since the remaining fibres must be masked off to provide 
the storage area needed on the CCD for the charge shuffling. 

Increased Sky noise - Since Sky is observed in the on 
and off position and then a simple image subtraction 
performed, there is no -^/n suppression of noise in the sky 
spectrum as was the case for dedicated sky fibres observa- 
tions, hence there is a \/2 increase in sky noise per spectrum. 

Read noise - There is a \/2 increase in effective CCD 
readout noise contribution due to the subtraction of the 
B position spectrum, which can be significant for shorter 
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observations. 

Telescope overheads - There will be a telescope settling 
time overhead for each telescope nod. Usually telescope 
slew time is insignificant in comparison to the settling time. 
The time scale for variations in the night sky emission i s 
known to be short (jGlazebrook fc Bland-Hawthornll20Q]f) . 
Experimentation with AAOmega nod- and- shuffle indicates 
the best subtraction is achieved with dwell times < 30 sec. 
However, this incurs a significant overhead in terms of 
telescope slew and settle time and an dwell time of 60 sec 
is more typical. Dwell periods in excess of 90-120 sec are 
found to adversely effect data quality. 



A3 Mini-shuffling 

One clear limitation of the basic nod- and- shuffle strategy 
is that one requires significant storage area on the CCD in 
which to hold charge-shuffled spectra during the alternate 
cycle of each observation. With the 2dF/AAOmega system 
this is provided by masking alternate fibres from the spec- 
trograph slit. This is physically achieved through the use 
of an opaque mask attached to the 2dF top end. This oc- 
cults the light to alternate fibres, which have been left in 
the parked position around the edge of the 2dF field plate. 
Attaching the mask is a trivial process, but it does require 
that the telescope be parked at prime focus access between 
observations, which adds 5-15 minutes to the observing cy- 
cle (depending on the location of targets on the sky with 
respect to the due north prime focus access position). 

With a high quality PSF within the spectrograph, one 
is at liberty to reduce the fibre-to-fibre spatial separation 
(the fibre pitch) on the CCD. In the idealised situation, 
each fibre would be separated by many multiples of the 
FWHM of the PSF, however tighter packing is required to 
make reasonable use of the available CCD real-estate. For 
the AAOmega system, a fibre pitch of ~ 10 pixels is imple- 
mented, for a fibre PSF of ~3.2-3.5 pixels (AAOmega ex- 
hibits a broadened PSF in the far red, a consequence of the 
greater penetration of photons into the CCD before detec- 
tion at these wavelengths, which becomes significant due to 
the fast / ~ 1.3 beam of the AAOmega red camera). Hence 
fibres are separated at the Jslir level, allowing a straight for- 
ward (intensity weighted) pixel summation across each fibre 
profil e during extraction, wi th minimal fibre-to-fibre cross- 
talk jSharp fc Birchal]||2010r i. 

With detailed modeling of the spectrograph PSF, one 
can implement an optimal extraction algorithm which simul- 
taneously extracts the profile information for overlapping fi- 
bre profiles, allowing fibres to be p acked more densely onto 
the CCD while limiting cross-talk (|Sharp fc BirchalJ 120101 ; 
iBolton fc SchlegefeOlOT l. The optimal extraction routine im- 
plemented for AAOmega within the 2dfdr data analysis 
suite, while significantly slower than a simple weighted sum- 
mation has be shown to produce ~l-2% c ross talk residuals 
even for fibres with a pitch < 1.5 x FWHM (|Sharp fc Birchalll 
I2010T I. 

Once tight packing of the spectra onto the CCD be- 
comes practical, it is possible to perform nod-and-shufflc 
observations with the full fibre complement. The AAOmega 
system was indeed designed with this mode of operation in 



mind. This mini-shuffling concept allows all 400 AAOmega 
fibres to be used in a single charge shuffling observation. 
With a fibre FWHM profile 3.2 pixels, and a fibre pitch 
of 10 pixels, the interfibre gap is adequate to provide the 
storage zone for B position pixels if an 800 fibre optimal ex- 
traction is then implemented to account for the in evitable 
cross -talk resulting from a fibre separation of ^1-er ([SharpJ 
l2009h . 



APPENDIX B: COMPARATIVE EFFICIENCY 
OF STRATEGIES 

In making a decision on which observing strategy to pursue 
for any given program, one must make a number of base-line 
assumptions with regards to one's program requirements, 
and in addition consider a number of mitigating factors. 

Bl Baseline assumptions 

FoV - One requires the large field of view common to fibre 
fed spectroscopic systems, and observations with a standard 
long slit or slit-mask system cannot achieve the desired goal. 

Target density - Sufficient targets are available per field of 
view to utilise all available fibres (maximise the multiplex), 
i.e. raw target numbers are a significant factor to the 
experiment. 

Figure of merit - Signal-to-noise and target number are the 
figure of merit, not some more abstract (yet important) 
measure of spectral fidelity. 

Fibre availability - Assume no fibres are required for calibra- 
tions, i.e. we ignore the 5-7% of fibres usually allocated to 
simultaneous sky observations. With the AAOmega system 
this number is typically 25 fibres. For large multiplex 
systems we note that due to PSF variations etc., this 
percentage will likely stay constant, requiring a similar 
number of fibres be allocated to sky per unit spectrograph 
rather than per telescope pointing. 

Sky limited observations - The observing strategy is assumed 
to move one out of the read-noise limit, i.e. the error bud- 
get is dominated by Poisson-noise from the sky background. 
When limited by the arrival rate of photons from one's tar- 
get source, one is not likely to require the sky subtraction 
accuracies provided by nod-and-shuffle observations in order 
to remove one's background signal. 

B2 Signal-to-noise implications 

The combined effect on the signal-to-noise ratio, given the 
baseline assumptions, of observing with the nod-and-shuffle 
observing mode is a 2y2 reduction in the signal-to-noise 
ratio achieved for a given integration time. This reduction 
comes from the following sources. 

Exposure time - A factor of 2 is lost in exposure time since 
half of the time must be spent on sky. This is a y/2 loss in 
signal-to-noise. 

Target multiplex - A factor of 2 is lost in the number of 
targets that are observed since one can only use half the 
fibres as the remaining half must be masked in order to 
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provide the storage area into which charge can be shuffled. 
Therefore to recover the same number of targets, one must 
observe two target sets for half of the exposure time each, 
a \/(2) signal-to-noise loss. 

Effective increased sky brightness - There is a factor of two 
increase in the effective sky background, with respect to the 
object's signal, due to the presence of sky emission in both 
the A and B positions. The noise component from the sky 
spectrum can not be suppressed as is the case with dedi- 
cated sky fibres. Essentially the sky is observed for the full 
exposure duration while the target is only observed for half 
of this time. This doubling of the sky signal is a further y/2 
effect in the SN calculation. 



field plate to allow CBS fibre pairs to be allocated. 

Gradients in the pattern of variation in the OH sky lines 
can be mitigated by observing in the classic ABBA pattern. 
This has the added advantage of removing the requirement 
for a significant number of intermediate telescope nods. 

Telescope nods should be performed in the Declination 
(north-south) axis to minimise telescope settling time for 
an equatorial telescope such as the AAT. 



B3 Mitigating factors 

There are a number of strategies which can be used to 
minimise the impact of nod- and- shuffle on the loss of 
signal-to-noise ratio. When all can be applied the loss in 
signal-to-noise can be reduced from 2\/2 to v2- 

Low source density - In the limit of a reduced number of 
valid targets per field, the loss from losing half of the fibres 
is removed. 

Mini- shuffling - The mini-shuffling strategy outlined in 
section IA3I can be used to allow the full fibre multiplex to 
be used. 

Cross-beam switching - In the limit of a reduced number of 
valid targets per field, two fibres can be allocated per target 
to remove the 50% on-source duty cycle effect. Again, the 
mini-shuffling strategy allows the maximum multiplex. 

The increased sky background cannot be mitigated, and 
so the \/2 signal-to-noise reduction per unit time from this 
component remains. However, this is a random error and 
not a systematic effect and so scales with the square root of 
an increased exposure time. 

In circumstances where a gain is found from the N+S 
technique the observing strategy when employed with a 
fibre multi-object spectrograph can deliver sky subtraction 
at the Poisson limit over a wide field of view and with a 
high multiplex. Our recommended observing strategy for 
using N+S with the AAOmega facility at the AAT is as 
follows: 

Configure the target field using either standard nod-and- 
slmffle with fibre masking (200 fibres) or using mini-shuffling 
(400 fibres). 

Allocate fibres to targets in pairs using cross-beam-switching 
(CBS) to alleviate the 50% on source duty cycle. 

Observe in blocks of 20 pairs of A/B telescope nods, with 
60 seconds dwell per position, for individual exposures of 
40 minutes per exposure. 

Perform three/four such integrations per 2dF fibres 
configuration in order to avoid losses from atmospheric 
perturbations on target positions (due to differential atmo- 
spheric refraction across the field). 

Telescope nods should be performed using 60-120 arcsecond 
offsets to minimise telescope slew time while allowing 
sufficient separation between A and B positions on the 2dF 



© 2002 RAS, MNRAS 000.[Tlfl8l 



